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ABSTRACT 

We present a search for non-Gaussianity in the WMAP first-year data using the two- 
point correlation function of maxima and minima in the temperature map. We find 
evidence for non-Gaussianity on large scales, whose origin appears to be associated 
with unsubstracted foregrounds, but which is not entirely clear. The signal appears to 
be associated most strongly with cold spots, and is more pronounced in the Southern 
galactic hemisphere. Removal of the region of sky near the galactic plane, or filtering 
out large-scale modes removes the signal. Analysis of individual frequency maps shows 
strongest signal in the 41GHz Q band. A study of difi^erence maps tests the hypoth- 
esis that the non-Gaussianity is due to residual foregrounds and noise, but shows no 
significant detection. We suggest that the detection is due to large-scale residual fore- 
grounds affecting more than one frequency band, but a primordial contribution from 
the Cosmic Microwave Background cannot be excluded. 
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The hypothesis of primordial statistical random Gaussian- 
ity lies at the core of our understanding of the Universe. 
The currently favoured standard cosmological model is built 
upon the assump tion of a set of initial conditions laid down 
during Inflation In the most simple model of 

Inflation, these initial conditions produce nearly Gaussian 
temperature fluctuations of the Cosmic Microwave Back- 
ground (CMB). If the CMB temperature held is indeed 
Gaussian distributed, its two-point angular correlation func- 
tion (or power spectrum in dual space) contains all the in- 
formation needed to characterise its statistical properties 
fully. In the Inflation paradigm and the hypothesis of sta- 
tistical isotropy (i.e. invariance under rotations of n-point 
correlation functions), the angular power spectrum Ce nat- 
urally became the standard statistical tool from which to 
extract all the cosmological information present in the CMB 
data J^ngman ct al. 1996). If, on the contrary, the mi- 
crowave background radiation is significantly non-Gaussian, 
it would be powerful evidence against Infiation with far- 
reaching consequences for our understanding of structure 
formation. 

As a consequence, testing the precise statistical nature 
of the CMB anisotropics is of extreme importance in or- 
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der to test the validity of our present theories of structure 
formation and of our current methods of parameter esti- 
mation. These statistical tests are complicated by the in- 
evitable presence of foregrounds, such as the emission from 
our own galaxy or extra-galactic point sources and system- 
atic and instrumental effects which may introduce spuri- 
ous non-Gaussianities in the measurements. At small an- 
gular scales, sec ondary anisotropies such as the Sunyaev- 
Zel'dovich effect JSunvaev fc Zeldovicblll972l). the Ostriker- 
Vishniac effect JOstriker fc VishniaJl986l:ICastr J2003l) . the 
Rees-Sciama effcct"^£C^^Sciama''w6^ and lensing by in- 
tervening struc tures (Blanchard & Schneider 1987; Hu 20o3; 
ICoorav fc Ke sden 2003) a s wel l as cross corr el ations be- 
tween them dCoorav fc 51120001: ICooravll2001allbtlKomatsul 
l200lt IVerde fc SDergelll2002l) ~l also leave their own non- 
Gaussian signatures; these a valuable source of cosmological 
and astrophysical information in their own right. 

Due to the scientific importance of measuring any 
non-Gaussianity, but also to the difficulty of detecting 
it, many non-Gaussianity studies have been performed 
at various angular scales since the first CMB measure- 
men ts by the C osmic Background Explorer (COBE) satel- 
lite llSmoot et al. 1992). Remarkably COBE provided the 
first non-Gaussianity detection by means of the b ispectrum 
jFerreira. Maeueiio fc G6rskilll998l:lHeavenj|l998l) . a detec- 
tion that was la ter claimed to be due to undocumented sys- 
tematic effects iBandav. Zaroubi fc Gorskil I2OO0I : iMaeueiil 
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l2000l:lMagueiio fc Medeirdl2004) . With the advent of higher 
sensitivity and higher resolution experiments, in particu- 
lar the W ilkinson Microwave Anisotropy Probe (WMAP; 
iBennett'et al. 2003.) . with its low noise and large sky cover- 
age, the pursuit of non-Gaussianity has become one of the 
main objectives of the CMB field. 

As non-Gaussianity can take innumerable forms, it is 
difficult to test its presence and there is no optimal general 
test for detecting and quantifying it. Different tests can show 
higher or lower sensitivity depend ing on the type of no n- 
Gaussianity being tested (see e.g. lAghanirn et alJ J2003M . 
Clearly, it is important to apply as many different estima- 
tors as possible as the eventual understanding of any devi- 
ation from Gaussianity will likely come from a combination 
of complementary results. 

A multitude of non- Gaussian estimators have thus 
been applied to the WMAP first year data such as the 
bispectrum, Minkowski functionals, three-point angular 
correlation function, higher-order moments of the angular 
correlation function, wavelets, phase correlations, ten- 
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show a variety of anomalies. The first of them concerns 
the low value of the quadrupole (multipole £ — 2), previ- 
ously observed in the COBE data and whose origin has 
been discussed in terms of super-horizon fluctuat io ns an d 
spatial curvature e.g. |Berer^ Fane fc Hinshawl Jl99dl : 
iBerera fc Heavenj (I2OO0I) : lEfstathioul ll2003h . Subseauentlv. 
studies indicated an unusual low probability planarity and 
alignment of the quadrupole and of the octopole {£ = 3), 
maybe even extending to higher multipole values {i = 5). 
In addition, an analysis pe rformed with Spherical Mexican 
Hat Wavelets jCruz et al.l 120051 provided the detection of 
a very non-Gaussian structure in the data, associated to 
a cold spot of around 10 degrees, visible in the WMAP 
temperature maps. Last but not least, the data appear to 
exhibit a North-South asymmetry on large scales. Notori- 
ously, all these claims indicate not only a deviation from 
the Gaussianity hypothesis but most alarming a violation 
of statistical isotropy, one of the two fundamental pillars - 
together with statistical homogeneity- of the Cosmological 
Principle. A plausible explanation for these surprising 
results is likely to be simply the presence of unaccounted 
systematics. But one cannot put aside the possibility of 
there being of cosmological origin, in which case it would 
have far-reaching consequences for our perception of the 
Universe and thus for the standard Cosmological model. 

The peak-peak correlation function of a Gaussian ran- 
dom field has been calculated exactly, both in the full- 
sky and in the flat-sky approximat ion, and depends, as it 
must, only on the powe r spectrum jHeavens fc ShetlallQQgl : 
iHeavens fc Guptal200lll . It has been shown to display a rich 
structure, in particular at small angular scales, raising the 
hope that it has a good sensitivity to a global non-Gaussian 
signal. The present study uses an estimator of the 2-point 
correlation function of the temperature hot spots (local max- 
ima) and cold spots (local minima) to look for any devia- 



tion from the Gaussian hypothesis in the WMAP first-year 
temperature data. Following the claims of a North-South 
asymmetry, we also analyse both hemispheres separately. 

This paper is organised as follows. In Section |5| and 
we explain in detail our methodology, including the map 
construction pipeline, the peak-peak correlation function 
estimator used and the statistical method chosen to test 
robustly the Gaussianity of the data. In Section 2] we ap- 
ply our method to the WMAP first year maps and present 
our results. In order to test various statistical properties 
of the whole data, we performed a full-sky study and a 
North-South comparison analysis of a map resulting from 
combining all the different frequency assemblies maps pro- 
vided by WMAP. In particular we investigated how this 
North-South comparison changes with cuts both in har- 
monic and real space. To further test for the origin of any 
non-Gaussianity, we present both a full-sky and a North- 
South analysis performed on single-frequency maps and on 
frequency-difference maps, which removes the primordial 
signal and tests noise and residual foregrounds. Finally, in 
Section |n] we present our conclusions. 



2 METHOD 

The two-point correlation function of peaks of a Gaussian 
random field can be calculated analytica lly, given only its 
power spectrum llHeavens fc Guptall200lh . 

The powe r spectrum has bee n estimated from the 
WMAP data jHinshaw et alJl2003l) . and we test the hy- 
pothesis that the field giving rise to this power spectrum is 
Gaussian. We do this by taking a brute force approach and 
use the best-fit ACDM theoretical power spectrum to the 
WMAP data as the starting point to generate a large num- 
ber of Gaussian maps at all WMAP frequencies, to which 
we then apply sky cuts, window functions and noise so as to 
mimic the real data. 

In this way, we estimate the hot spot and cold spot 
correlation functions, and their covariance, for the Gaussian 
maps created with the same algorithm as the real data. We 
then use these to test the Gaussian hypothesis. 



2.1 The maps 

2.1.1 The WMAP map 

The WMAP satellite probed the CMB at five different fre- 
quencies with two radiometers, producing ten differencing 
assemblies (DAs): four on the W-band (94GHz), two on the 
V-band (61GHz), two on the Q-band (41GIIz), one on the 
Ka-band (33GHz) and one on the K-band (23GIIz). Each 
of these assemblies, after calibration and removal of the 
monopole and dipole, is available for download from the 
WMAP web-site\ AU the maps are provided in the Hi- 
erarchical Equal Area isoLatitude Pixelisation (HEALPix) 
scheme^, which has proved to have several advantages over 
other methods of pixelising the surface of a sphere, in par- 
ticular the fact that the pixel area is kept constant through- 
out the surface of the sphere. However, the pixel shapes 



^ http:/ /lambda. gsfc.nasa.gov/product/map/m_products.cfm 
^ http:/ /www. eso.org/science/healpix/ 
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can vary largely between the equatorial and polar regions 
and distance between pixel centres is not kept constant. The 
HEALPix scheme divides the sphere surface into 12 faces of 
4 sides each, giving a minimum resolution of 12 pixels. Each 
side is divided in N^ide pixels, giving a total number of pix- 
els in a map of 12iV2^,. The WMAP maps were provided 
at a resolution of Ngide = 512 giving a total of 3,145,728 
pixels separated on average by Opi:^ = 0.115 degrees = 6.87 
arc minutes. 

Each DA map pixel p contains the temperature field 
(in mK) and a field containing the number of observations, 
Nobs(p), which allows the noise per pixel to be estimated 
using 



•y/ NobB(p) 



(1) 



where ao is the noise dispersion per map and which has been 
publi shed for each of the different assemblies iHinshaw et alJ 
l2003l) . Also available is a foreground-cleaned map of each 
of the DAs (for details concernin g the removal of the fore- 
grounds see iBennett et aD i2003l) ). from which a Galactic 
foreground template has been removed, consisting of syn- 
chrotron, free-free and dust emission. 

The WMAP map used in this work is a linear combina- 
tion of the eight foreground-cleaned assemblies in the Q, V 
and W bands (the Q VW map). We f o llow t he construction 
method suggested by iKomatsu et al.l (|200^ and produce a 
temperature map given by 



Tqvw{p) = 



E 



,(p) 



with the weights being given by 
1 

Wj{p) 



]{p) 



(2) 



(3) 



The index j corresponds to the different DAs: j — 1,2 cor- 
responds to the V band, j = 3, 4 to the Q band and j = 5 
to 8 to the W band. 

This map proved to be well suited for this work. 
Compared to other foreground-cleaned maps such as 
the Internal Linear Combination (ILC) map published 
by the WMAP team and the Tegmark Clean Map 
jTeemark. Oliveira-Costa fc Hamiitoiil l2003h (TCM), the 
QVW map keeps all the small scale structure by not smooth- 
ing the DAs to a common resolution before construction 
(also keeping the noise per pixel nicely uncorrelated) and 
shows less noise power at high multipoles. And since all the 
weights are known in the QVW map, it also becomes trivial 
to construct Gaussian simulations of this map. 
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Figure 1. The power spectrum of our working maps. WMAP 
data in the solid line (black in the online version) and one of our 
Gaussian maps in the dashed line (red in the online version). 



• We add uncorrelated noise to each of the maps accord- 
ing to equation Q (a more accurate noise model is used for 
difference maps). 

• We combine the 8 resulting maps using equations |(2J 
and 

We repeat this procedure to create many Gaussian simula- 
tions of the CMB, each being a random Gaussian realisation 
of the same initial power spectrum. We used different num- 
bers of Gaussian maps in different types of analysis, and 
we quote each number within the appropriate section. The 
maps are time-consuming to produce but in each case we 
check convergence of (see Figures 1^ 1121 and ll6l for exam- 
ples). A comparison of the power spectrum of the real and 
a simulated map can be seen in Figure 

Although at small angular scales the noise properties 
are white, fully understood and easily modelled (see discus- 
sion in section 2.1.1), at large angular scales, individual Q, 
V and W assemblies present noise characteristics which are 
non-white. Fortunately these are entirely dominated by the 
sig nal and one does not n eed to worry about them (see Fig. 1 
of iHinshaw efsl] J2003l) ). The WMAP team have produced 
a set of 110 noise maps which include white noise (dominat- 
ing at small scales), 1/f noise (dominating at large scales) 
and inter-channel correlations for each of the radiometers, 
and ideally one would like to incorporate all known effects 
into the analysis. However, being limited by the relatively 
small number of full noise simulations and due to the high 
signal to noise ratio at the scales where the noise properties 
deviate from white, we choose to include only white noise in 
our Gaussian QVW and single frequency maps. 



2.1.2 The Gaussian maps 

To construct Gaussian simulations of the CM B, we follow 
the method suggested bv lKomatsu et aTl i2003l) and proceed 
in the following way: 

• We generate one sky realisation from the best fit ACDM 
model power spectrum, published in the WMAP web-site. 

• We copy this map 8 times, one for each assembly, and 
convolve each of the copies with the appropriate beam trans- 
fer function, again published in the WMAP web-site. 



2.1.3 A more extensive set of masks 

In addition to the masks published in the WMAP web-site, 
a new set of masks was constructed, based on the WMAP 
team's kpO mask which masks the galactic plane plus all 
known point sources in the sky. The initial motivation 
came from the need to work with smoothed maps. The 
QVW maps (real and Gaussian) show a large number of 
peaks at full resolution which creates a problem concerning 
the computational time taken to calculate the peak-peak 
correlation function. So a trade-off between resolution and 
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Table 1. The new and original masks 



Mask name 



kpO 
SOB 
30B_knorth 
30B_ksouth 
bcut30_knorth 
bcutSO-ksouth 



[61 cut Percentage of removed sky 
(degrees) 

23 
28 
64 
64 

30 76 
30 77 



random catalogues of 400,000 points for full sky and 200,000 
points for single hemispheres, with the same sky cut as the 
appropriate WMAP map, and ensure that the estimator has 
converged to a stable value. A hot spot (cold spot) is defined 
for the purposes of this analysis as the centre of any pixel 
whose temperature is higher (lower) than the temperature 
of all pixels with which it shares a boundary. 

The correlation function is estimated in 300 equally- 
spaced bins up to a maximum separation of 1800 arc min- 
utes. 



computational limitations was reached and this consisted 
of applying an additional Gaussian beam of FWHM — 
12 arc minutes to the raw maps (real and Gaussian), as 
constructed in Section 2.1.1. Because some of the unwanted 
(masked) signal could leak from the masked regions due 
to this smoothing, the mask kpO was also increased (by 
effectively smoothing it with the same smoothing kernel 
applied to the data) to create a new mask, dubbed mask 
SOB. Regions in the smoothed mask with values greater 
than 0.01 were masked, where the raw mask map has values 
zero and one, with one meaning masked. 

In addition to this extension, the new mask was 
further modified to include either only the Northern or 
the Southern hemisphere and also to optionally exclude a 
region close the galactic plane, |6| < 30°. 

A summary of the properties of each of the new masks 
is given in Tabled 



2.2 Peak-peak correlation function 

The two-point correlation function ^ of a discrete distribu- 
tion of points can be defined as 



dP ^ n{i + ^{e))dn 



(4) 



where dP is the probability of finding a data point within 
a solid angle dil, at a distance 9 from another randomly 
chosen data point and n is the average density of points in 
the sample. So ^{6) can be seen as an excess of probability 
of finding a pair at a distance 0, compared to a random 
catalogue - it is a measurement of clustering - and ^(S) =0 
indicates a uniform random distribution (no clustering). 

There are several estimators suggested in the literature to 
estimate ^{6) directly from the data. They all work by com- 
paring the sample of points to an uniform, random catalogue 
wit h the same spatia l distribution as the real data. We used 
the iHamiltonl il993h estimator, which promises fast conver- 
gence: 



RR{e).DD{e) 
DR{ey 



(5) 



where RR{9) and DD{6) are the number of random and 
data pairs respectively at a distance 9 from each other and 
DR{9) is the number of cross-pairs separated by a distance 9 
(all weighted by the number of total random, data and cross 
pairs in the catalogue) . Indeed, we found it to converge faster 
than the standard estimator, ^{9) — ^^^^^j — 1. We use large 



3 COMPARISON WITH GAUSSIAN MAPS 

We test the Gaussianity hypothesis of the WMAP data by 
comparing our estimator of the peak-peak correlation func- 
tion ^{9) applied to the data with its values when applied to 
a set of synthetic Gaussian maps with the same cosmology 
as the WMAP best-fit. 

As explained in the previous section, we apply an ad- 
ditional Gaussian beam of FWHM of 12 arc minutes to our 
real and Gaussian maps, constructed as in section 2.1.1 and 
2.1.2, to smooth out some of the small scale noise and make 
the number of peaks slightly more manageable. We mask all 
maps with the mask 30B and we select all maxima with tem- 
perature above a given threshold v and all minima below —i/ 
(u in units of the map rms, cr). We estimate the peak-peak 
correlation function using equation 

We choose to quantify the robustness of any non- 
Gaussianity detection by means of the statistic, which 
we compute for the WMAP data and for each of the Gaus- 
sian maps: 



X 



sian maps available: 



(6) 



(7) 



Here i,j are bins at a given angular separation and the 
is the average of our estimator over all Gaussian maps of 
that particular bin. Previously to computing equations 
and we rebin all data to 19 bins, of which we discard the 
first one^. Rebinning is necessary, otherwise dj is close to 
singular and numerically unstable to inversion. 

For Gaussian and uncorrelated errors we recall that the 
99% confidence level for a reduced distribution with 18 
degrees of freedom comes at < 0.38 and > 1-93- We 
also use the distribution estimated directly from all of the 
Gaussian maps to give more empirical values for confidence 
levels - see Section 14.71 for a more detailed discussion on 



^ HEALPix defines neighbouring pixels as ones which share a 
pixel face. However, due to the highly variable pixel shapes in 
the surface of the sphere, these are not necessarily the closest 
pixels to the central one. This occasionally results in HEALpix 
selecting two very close pixels as being separate peaks which in 
turn results in unexpected (but explainable) features in the first 
few bins. Hence we choose to ignore these bins (which fall into 
the first one after rebinning). The effect these extra peaks have 
at large angular scales was tested for and found to be negligible. 
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confidence levels. We present our all results using reduced 
values of with a total of 18 degrees of freedom. 



4 RESULTS 

We use the peak-peak correlation function in a number of 
different ways to investigate the properties of the maps (we 
use H for Hot, C for Cold, N for North and S for South): 

• The most obvious way is to conduct a full-sky analysis 
in the unmasked regions of the maps, which we do for hot 
and cold spots separately - £^H,i and ^c,i. 

• We also compute the peak-peak correlation function in 
each of the hemispheres individually, again looking at hot 
and cold spots separately in each case - ^NH,i, ^NC,i, ^SH,i 
and ^sc.i- 

• In addition we look at the difference of correlation be- 
tween the two hemispheres at a given angular scale and we 
define A(,H,i = £,SH,i — £,NH,i (similar for cold spots). 

• Finally, we take the average of the peak-peak correla- 
tion function in the Northern and Southern hemispheres in 
order to produce a computationally faster way to estimate 
the full-sky function - ^H,i (similarly for cold spots). 

We explain each of these in detail in the following sections. 
4.1 All-sky analysis 

We first consider all the hot spots above a certain threshold 
fo" (or cold spots below a negative threshold —i^o-), for the 
entire sky except for the masked regions of galactic plane 
and point sources. The results for a threshold of i/ = 1.5 are 
shown in Figure |21 However, with no specific type of non- 
Gaussianity in mind there is no particular reason to choose 
any given threshold - too high a threshold reduces the num- 
ber of peaks and the study is limited by cosmic variance, 
too low and the computation time is uncomfortably long. 
Our choice was a compromise to obtain a large number of 
extrema to get small errors, whilst not being computation- 
ally too expensive. We also plot the peak-peak correlation 
function averaged over 100 Gaussian maps and the error 
bars on the Gaussian curve are the errors on the mean. The 
small error bars show good convergence of the average of 
the peak-peak correlation function from the 100 Gaussian 
maps. Figure |5] shows the convergence of ^^r and with 
increasing number of maps. 

Although not optimally sampled, the structure we see 
at small angular scales is re al structure, as expected from 
iHeavens fc ShethI Jl999D and lHeavens fc Guptal j200lD 

We see immediately that neither the hot spots nor the 
cold spots follow the Gaussian simulations - the cold spots 
show an excess of correlation whereas the hot spots show a 
lack of correlation with respect to the Gaussian simulations. 
These differences are, however, not significant; one disad- 
vantage of the correlation function is that the errors can be 
highly correlated. The distribution of the values for all 
of the Gaussian maps can be seen in Figure^] together with 
the values for the WMAP data. We find = 1-25 for the 
hot spots and = 0-65 for the cold spots, which are within 
the Gaussian la confidence level. So the maps analysed in 
this way do not show any sign of non-Gau ssianity. This is in 
agreement with iLarson fc Wandel^ (l2005ll who also find no 
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Figure 2. The peak-peak correlation function of WMAP's data 
hot spots in the dashed (red) line and cold spots in the solid (blue) 
line. Simulated data (averaged over 100 Gaussian simulations) in 
the middle (black) line - the error bars shown are the errors on 
the mean. Note that as these are correlation functions, the errors 
are correlated. 
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Figure 3. Convergence of (solid, blue line) and 1;h (dashed, 
red line) with number of Gaussian maps used to estimate and 
di as defined in Section 3. 



significant deviation from Gaussianity when they compute 
the peak-peak correlation of hot and cold spots (although 
they work with lower resolution maps). 

There have been numerous claims in the literature (see 
Introduction) that the WMAP maps show an asymmetry 
in their statistical properties between the Northern and the 
Southern hemispheres, so we turn to this next. 

4.2 North-South analysis 

To further investigate any discrepancy between the WMAP 
data and our Gaussian simulations we apply the masks 30BN 
and SOBS to all our maps and proceed to get an estimation 
of the peak-peak correlation function for the Northern and 
Southern hemispheres separately. 

Figure |^ shows the peak-peak correlation function of 
the WMAP data for cold and hot spots calculated in the 
Northern and Southern hemispheres. We find a difference 
between the correlation of cold spots in the different hemi- 



6 Tojeiro et al. 



20 r 



15 - 



z 



5 - 



J] 



1 



0.5 



1.5 

2 

X 



2.5 



40 r 



30 - 



z 



10 - 



0.5 



1.5 

2 



2.5 



Figure 4. The distribution of reduced values for all of the 100 
Gaussian maps: hot spots in the dashed (red) line and cold spots 
in the solid (blue) line. The values for the WMAP data are 
represented by the small triangles and vertical lines. 



Figure 6. The distribution of reduced X%s ^^"^ Gaussian 
maps. Hot spots in the dashed (red) line and cold spots in the solid 
(blue) line. The X^s values for the WMAP data are represented 
by the small triangles and vertical lines. 




Figure 5. The peak-peak correlation for the WMAP data in the 
two hemispheres - solid lines show the South and dashed lines the 
North. The inner pale (red) lines show hot spots and the outter 
(blue) lines show cold spots. 



spheres. Again we use a statistic, Xns for and A^//, 
with the mean and covariance matrix estimated from 250 
Gaussian maps. By analysing each hemisphere seperately, 
we are reducing the number of peaks available for the es- 
timation of the peak-peak correlation function. Hence we 
found that a greater number of maps was needed to en- 
sure good convergence of the average peak-peak correlation 
function and of the covariance matrix. See Figure [T^ in the 
next section for convergence of some of the statistics with 
increasing number of maps. 

We calculate Xns for our ensemble of Gaussian maps, 
whose distribution can be seen in Figure |S] together with 
the Xns value calculated for the WMAP data, for hot and 
cold spots. 

We note that the fact we are finding the South-North 



difference not to be significant {x%s ~ 0.748 and 1.27 for hot 
and cold spots respectively) may be due to the fact that the 
peak-peak correlation function of threshold-selected peaks is 
highly sensitive to cosmic variance in the low multipoles. All 
the estimators are highly correlated and are shifted up and 
down in synchrony from Gaussian realisation to Gaussian 
realisation: the noisy low-^ multipoles can shift large num- 
bers of peaks above or below the threshold depending on the 
mode amplitude. This suggests that the use of a high-pass 
filter - effectively removing the signal from cosmic variance 
above a given angular scale on the sky - may be an efficient 
way to increase the sensitivity to non-Gaussian features. 

4.3 Constraining in harmonic space 

We high-pass filter our maps (real and Gaussian) 
by constructing several window functions given by 
WioutW = for £ < icut, Wi.^tW = 1 otherwise, which 
we apply to our maps and we investigate cases with 
lent = 0, 5, 10, 15, 20, 25, 30 and 40. 

We mask the WMAP maps before filtering. This is nec- 
essary because of the presence of foregrounds - the strong 
ringing effect in pixel space which results from such a sharp 
cut-off in harmonic space causes unwanted foreground sig- 
nal to leak from the masked region. We follow the algorithm 
described below: 

• We mask the WMAP data. 

• We extract the aim coefficients using HEALPix's rou- 
tine anafast. 

• We multiply the harmonic coefficients by the respective 
window function. 

• We generate the map in pixel space by using the 
HEALPix's routine synf ast. 

• We re-mask the map. 

• We remove the monopole/dipole from the unmasked 
regions. 

Since there is no foreground contamination in the Gaus- 
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Figure 7. Afc(^) for the WMAP map high-pass filtered with 
different values of Icut- 



sian maps, there is no need to apply the initial mask. For 
testing purposes, we applied both methods to a number of 
Gaussian maps and found them to produce the same final 
results. 

For each case we construct an ensemble of 250 Gaussian 
inaps. We compute A^//, A^c, (,nh, £,nc, ^sh, Csc, and 
for all our Gaussian maps as well as the WMAP data. 
Figure [T^ shows convergence of ^sc and A^c with number 
of maps in the solid and dashed lines respectively. The same 
plot also shows the convergence of the same statistics but 
this time calculated in a single-frequency Q-band map (see 
Section 4.5 for a single-frequency analysis). We quote all 
significance levels in Table |21 

Figure |7| shows A^c{d) for some different i^ut in the 
WMAP data. We note that the difference between the 
Southern and Northern hemispheres decreases as we remove 
more and more of the low order multipoles. This could be 
either due to the fact that cosmic variance alone is to blame 
for the North/South difference we see, or it could be due to 
the fact that whatever is causing this North/South differ- 
ence is intrinsically a large scale effect. 

We test the significance of each of these differences by 
using Xns- Figure|S]shows xl/s(^cut) for cold and hot spots. 
We plot the distribution of Xns using all the different £cut 
Gaussian maps - these maps are not strictly independent 
(although the statistics share the same underlying dis- 
tribution over all values of Icut) so we use only the 250 in- 
dependent maps at each Icut to draw conclusions about the 
significance of each detection - see Section 14.71 The added 
histogram over the 2000 maps can be seen in Figure 

We do the same test and construct identical plots for all 
our statistics: in/ic in Figure and ^nh /^nc /^sh /^sc 
in Figure [TTI The added histograms across all values of £cut 
for these statistics are very similar to that shown in Figure 

m 

The first point to make is that the non-Gaussianity is 
consistently absent at Icut = 40: there is no evidence from 
the peak-peak correlation function of non-Gaussianity on 
scales with £ > 40. 

The most significant non-Gaussian detections come 
from the cold spots in the Southern hemisphere, £,sc, at 
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Figure 8. X%s ^ ^ function of £cut for the WMAP data. Hot 
spots in the solid (red) line, cold spots in the dashed line (blue). 
The circle (blue) and the diamond (red) are the value (cold 
spots and hot spots respectively) for runs with the regions of sky 
within 30 degrees of the galactic plane removed (see Section 14.41 . 
We recall that the 99% confidence levels for a distribution of 
18 degrees of freedom come at x^ ^ 0.42 and ^ 1.93. See 
Figure |5] for the distribution of the X%s ™iues of all Gaussian 
maps. 

£cut = 10 (with a value of ~ 3.877 and a probability for- 
mally estimated to be lO"'^ - see Section l4.7^ . where we 
also find significant detections in the South-North differ- 
ence for cold spots, A^c {Xns ~ 2.302), and in the aver- 
age of Northern and Southern hemispheres for cold spots, 
(.c ix = 3.011). In addition to this, we have less significant 
detections at £cut = 20, 25 and 30 in isc (x^ = 2.747, 2.764 
and 2.756 respectively) and (x^ = 2.658, 2.923 and 2.601, 
see Figures [THl and im . All of these do not appear in a North 
minus South analysis. This could be simply because the sig- 
nal is not significant enough to show up in such analysis (we 
are roughly doubling the variance of our estimator by sub- 
tracing the data of the Sorthern and Northern hemispheres). 

4.4 Constraining in real space 

We further investigate the origin of this detection by re- 
moving extra regions near the masked galactic plane. We 
work on the maps where the significance of the signal is the 
strongest (those with £cut ~ 10), which we mask with the 
extended masks: bcut30_knorth and bcut30_ksouth. 

We proceed the same way as before and compute the 
full set of estimators: A^h, A^c, ^nh, ^nc, ^sh, Csc, |w 
and for all our Gaussian maps as well as the WMAP 
data and use the adequate x^ statistic for each of them to 
test the WMAP data for non-Gaussianity (we generate new 
random catalogues whose spacial distribution follows that 
of the new masks). 

Figures 151 1 1 01 and 1111 show how the new values com- 
pare with the ones previously obtained when we did not use 
any extra galactic cut - all values drop significantly to values 
which are perfectly consistent with the Gaussian hypothesis 
(the most extreme value being Xns ~ 1-46 for A^c) , in- 
dicating that our significant non-Gaussian detection in the 
cold spots is located within 30 degrees of the galactic plane. 
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Figure 9. The added distribution of X^g values for all our Gaus- 
sian maps at all different tcut- Hot spots in the dashed (red) line, 
cold spots in the solid (blue) line. Similar histograms were pro- 
duced for all of our other statistics, and all show an identical 
added distribution of reduced values. 




^cut 

Figure 11. x^i^cut) for ^jvc (blue dotted line), fjvH (red dot- 
dashed line), S^sc (blue solid line) and 6,sh (red dashed line) for 
the WMAP data. The single point at taut = 10 are the values 
for runs with the regions of sky within 30 degrees of the galactic 
plane removed (see Section l4.4l : £,nh in the red circle, ^jvc in the 
blue square, S^SH in the red triangle and £^sc in the blue cross. 
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Figure 10. x^i^cut) for solid (blue) line and dashed (red) 
line for the WMAP data. The single points at taut = 10 are the 
X^ values for cold spots (blue cross) and hot spots (red circle) 
in runs with the regions of sky within 30 degrees of the galactic 
plane removed (see Section 14.41 . 
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Figure 12. Convergence of some of our statistics which yielded 
detections of non-Gaussianity with increasing number of maps. 
For the QVW map we show in the solid line and A^,^ in the 
dashed line. For the single-frequency Q-band map we show gsc 
in the dotted line and A^c in the dot-dashed line 



Clearly, this hints at residual foreground contamination as- 
sociated with the Milky Way. 

We note that we have only tested this on maps with 
^cut = 10 since this is where we have found our strongest 
detection. We cannot discard the possibility that the effect 
that yields detections on maps with Icut = 15, 25 and 30 is a 
different effect altogether which does not lie in the galactic 
region. 

4.5 A single-frequency analysis 

To check whether the non- Gaussian signal we detect is re- 
lated to possible residual foregrounds in the WMAP data 
we conduct a single frequency analysis of the maps. In- 



deed, the expected galactic foreground contribution to the 
WMAP maps consists mainly of synchrotron, free-free and 
dust emission. All of these effects are frequency-dependent 
and obviously non-Gaussian. If any foreground residuals are 
still present in the foreground-cleaned data then we would 
expect them to contribute differently to each of the different 
frequency maps. We note that any residual noise may also 
contribute differently to each frequency. 

We construct the real map and each of the 250 simu- 
lated single frequency maps by following the same method 
we used to construct the co-added QVW map (described 
in Section I2.1II . but we use only the relevant DAs for each 
of the frequencies. We then smooth the WMAP and Gaus- 
sian maps with a 12 arc minute FWHM Gaussian beam and 
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Figure 13. x|^rg for all three frequencies: Q (41 GHz), V (61 
GHz) and W (94 GHz) on maps with (.cut = 10. Statistics for 
cold spots in the solid (blue) line, for hot spots in the dashed 
(red) line. 
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Frequency/GHz 

Figure 15. for all three frequencies: Q (41 GHz), V (61 GHz) 
and W (94 GHz) on maps with Icut = 10. Statistics for gj/ in the 
dashed (red) line and for E^c in the solid (blue) line. 




Frequency/GHz 

Figure 14. for all three frequencies: Q (41 GHz), V (61 GHz) 
and W (94 GHz) on maps with lent = 10. Statistics for ^jvh in 
the dot dashed (red) line, ^jvc in dotted (blue) line, €,sh in 
the dashed (red) line and finally for E^sc in the solid (blue) line. 



high-pass filter with a lent = 10 window function (where we 
had the most significant non-Gaussian detection). 

We calculate the full set of estimators for each of the 
frequencies: A^h, A^c, ^nh, ^nc, ^sh, Csc, in and |c, for 
which the values can be seen in Figures lOl 1141 and [T^ 

We find significant non-Gaussian signals coming from 
the cold spots A^c in the Q band {xirs = 2.081) and ^sc 
in all three bands, although it is strongest in the Q band 
(with = 3.831). We also find detections in our full-sky 
estimates in the cold spots in all three bands, and, for the 
first time, in the hot spots in bands Q and W (see Figure 

We may be seeing a frequency-dependent type of non- 
Gaussianity, although we can not put aside the possibility 
of a cosmological origin. To improve readability we do not 
present the plots with the distributions of the 250 Gaus- 



sian maps for each of the frequencies and for each of the 
estimators. We do, however, quote the number of Gaussian 
maps with a Xoaussian > XwMAP for all significant detec- 
tions in Table II Section lO 

4.6 Removing the cosmological signal 

In order to investigate the possibility of any contributions 
from foregrounds or unexplained noise properties, we remove 
what is taken to be the cosmological signal from our analysis. 

To do so we subtract different single-frequency maps to 
produce three maps which contain only a mix of subtracted 
residual foregrounds (if any) and noise. We produce aV — Q, 
a.V — W and a,Q — W map, which are simply a pixel- by-pixel 
subtraction of each of the single frequency maps, constructed 
as described in Section f4.5l 

With the cosmological signal removed, the detailed 
noise properties of these 3 subtracted maps at large angular 
scales now become important for our analysis and one should 
be careful when constructing equivalent Gaussian maps (see 
Section 2.1.2). We therefore take a slightly different route to 
construct the Gaussian simulations with which we compare 
the WMAP data, and we now make use of the 110 noise 
simulations supplied by the WMAP team. We construct 
single-frequency noise maps by adding the respective indi- 
vidual radiometer simulations following the same weighting 
scheme as described in Section 12.11 which we then smooth 
and high-pass filter with a Icut ~ 10 window. We then sub- 
tract different frequency noise maps in order to produce 110 
simulations with which we compare our real V — Q, V — W 
and Q — W maps. We reemphasize that for maps which 
include the signal, the non-white nature of the noise at low- 
l is essentially irrele vant, as the signal dominates entirely 
jHinshaw et alJl2003t) . 

We construct A(,h, A^c, ^nh, ^nc, ^sh, Csc, in and 
for the simulations and the real data as before and use the 
respective x^ statistic to probe for non-Gaussian signatures. 

In this case, our total number of maps was constrained 
by the number of noise simulations provided by the WMAP 
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Figure 16. Convergence of the reduced values for A^h i 
CiVffi ^JVCi isH and £^sc in the Q — W map as a function of 
number of Gaussian maps. 
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Figure 17. ^jvc B.nd £^sc estimated from the V — Q subtracted 
maps. WMAP's data are the solid line for Southern hemisphere 
and dashed line for Northern hemisphere (both in blue). Gaussian 
averaged data in dotted line for Southern hemisphere, dot dashed 
line for Northern hemisphere (both in black). 



team. Figure IT?)1 shows how the reduced values for A^_f/, 
A^c, ^NH, Civc, Cs-H and ^sc in the Q - W map change 
with number of simulated maps used (the Q — V and V — W 
maps produced very similar results) . The results show clear 
convergence to some value well within the la confidence lev- 
els. The reason why we observe faster convergence in these 
maps could simply be due to the fact that we are removing 
the cosmological signal from the analysis and with it much 
of the variance. 

Figure IT7I shows fjvc and ^sc for the WMAP data and 
also and where the average is done over the 110 
simulated V — Q noise maps. 

Some comments on this figure are appropriate. Firstly 
we note that there is an intrinsic North/South asymmetry 
in the Gaussian noise maps. This is due to the large-scale 
structure generated in the noise due to the uneven scanning 



pattern of the WMAP satellite. We recall that pixel noise is 
weighted according to the number of times a pixel has been 
observed, and as such this feature is fully simulated in all 
our previous maps. This large-scale structure combined with 
the fact we are applying an asymmetric mask to the data 
results in the non-zero and North/South asymmetric peak- 
peak correlation function we see. We draw attention to the 
fact that this asymmetry is qualitatively different from what 
we found in Sections 4.2, 4.3, 4.4 and 4.5, since we now find 
an excess in correlation in the Northern hemisphere, as op- 
posed to in the South'*. This excess in correlation in the 
North is indeed seen in the Gaussian-averaged peak-peak 
correlation function of all our previous maps, although on 
a much smaller scale. Finally we note that there is a more 
noticeable deviation of the WMAP data from the Gaussian 
simulations in the Southern hemisphere. However, we find 
none of these to be significant. In fact, this statement ex- 
tends to the other two cases: V — W and Q — W. We find 
no signs of non-Gaussianity in any of the estimators in any 
of our combined noise and foreground maps, with all the 
values well within values which are consistent with the 
Gaussian hypothesis (our most extreme value comes from 
^SH in the V — Q map, where we find x^ = 1-49 - see Table 
2 in the next section for a summary of the most extreme 
values in all three maps) . 

4.7 A summary of our results 

In this subsection we take the opportunity to summarise 
our results into one table and to elaborate on the confidence 
levels we have quoted throughout the paper. We do this by 
presenting a table with all the statistics for which we have 
found the WMAP data to have a reduced Xwmap ^ 2, 
Table H 

We recall that in all cases we have rebinned the data 
into 19 linearly-spaced bins, of which we use the last 18 
to compute each of the x^ statistics. The Ptheory column 
gives the probability of randomly obtaining a given value 
of ^ Xwmap assuming the underlying distribution 
is a x^ distribution with 18 degrees of freedom, and the 
Ncaussian columu shows how many Gaussian maps have a 
^ Xwmap for correspondent estimator (the num- 
ber in brackets in the total number of Gaussian maps) . It is 
worth noting that the x^ distribution we estimate from the 
Gaussian maps fits a x^ distribution with 18 degrees of free- 
dom which has been shifted slightly by A^^ « 0.1 to lower 
values. Hence any limit on high values of x^ based on this 
theoretical distribution is a conservative one. These are the 
limits we quote throughout the text. Shifting the Gaussian 
X^ distribution by Ax^ = 0.1 results in the Ptheory values 
in Table 2 roughly being halved. 

We draw attention to our most striking detections, 
which come from the cold spots in the Southern hemisphere, 
appearing both in the co-added QVW map and in the single 
frequency Q band map with reduced values of 3.877 and 
3.831 respectively. 

* As a sanity test, we have also performed an identical analysis 
on purely white noise maps which include the WMAP's satellite 
scanning pattern and found them to have the same North/South 
asymmetry behaviour. 
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Table 2. Our main detections. We present all situations which yielded vales of > 
2. In addition to this and for the sake of completeness we also present the most 
extreme values obtained in Section 14.61 Ptheory is the theoretical probability of 
randomly obtaining a reduced ^ ^WMAP assuming a reduced distribution 
with 18 degrees of freedom and NQausaian is the total number of Gaussian maps 
with x^ ^ ^'wMAP^ brackets is the number of Gaussian realisations used for each 
statistic. 
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5 CONCLUSIONS 

In this paper we have used the peak-peak correlation func- 
tion of the local extrema in the CMB temperature fluctua- 
tions to probe for non-Gaussian signatures. As explained at 
the start of Section 4, we have constructed a series of statis- 
tics which look both at cold and hot spots in the full sky, 
in the Northern and Southern hemispheres separately and 
at the difference between both hemispheres. We have also 
looked at a variety of maps: a co-added QVW map with 
a standard mask applied, a co-added QVW map with an 
extended galactic cut applied, three co-added maps at the 
same frequency and three differences maps so as to remove 
the cosmological signal. In the case of the co-added QVW 
map we also investigated eight different cases with some of 
the low multipoles removed. 

Our main results are summarised in Table |5| in Section 
14.71 - we find strong evidence for non-Gaussianity, mainly 
associated with the cold spots and with the Southern hemi- 
sphere; this non-Gaussianity disappears completely if we fil- 
ter out the harmonic modes £ < 40 and at least partially if 
we exclude sky within |&| < 30°, so it is a large-scale effect 
associated with the galactic plane. 

Recently, iLarson fc Wandel^ (l2005l) have also used the 
peak-peak correlation function of cold and hot spots in their 
search for non-Gaussianity. Direct comparison of results is 



not straightforward as the resolutions of the maps used in 
the two studies are significantly different. However, in the 
simplest case where both groups looked at the full sky CMB 
temperature field (with equivalent masks based on the stan- 
dard kpO mask applied), both results are in agreement in 
the sense that both fail to yield a detection. We believe this 
lack of detection is a result of large cosmic variance in low-^ 
multipoles. 

We investigate this further by removing some of the 
low order multipoles from the maps, in the hope that by 
doing so we are increasing our sensitivity to non-Gaussian 
features by reducing the effects of cosmic variance. Once 
we remove all harmonic modes with ^ < 10 we systemati- 
cally find anomalies related to the cold spots in the WMAP 
data and, when looking at both hemispheres separately, we 
not only find a striking North/South asymmetry, we re- 
peatedly find the strongest anomalies to be in the South - 
ern hemisphere. This is not unheard of: IVielva et alJ (|20o3) 
first found an anomalous large cold spot in the Southern 
hemisph ere (nicknamed The Spot) , detection wh i ch wa s fol- 
lowed by'Cruz et al.' ^'2005^. lMukheriee fc W ang (2004) and 
iMcEwen e_t al, (2005) and confirmed repeatedly. However, 
we do find that our detections disappear when we exclude 
sky regions within 30 degrees of the galactic place (we recall 
that The Spot is localised at approximately {b — —57° , I — 
209° ) , well outside our cut regions of sky) . We therefore con- 
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elude that our detections come mainly from something other 
than The Spot. 

The North/South asymmetry is also something which 
has been q u oted time and time a gain i n the literature: 
IParkI j2004li . lEriksen et alJ ll2004allbl l2005h and other work 
previously quoted in this paper have consistently found 
non- Gaussian asymmetries between the two galactic hemi- 
spheres. The asymmetry we find in this study seems to be a 
large scale effect, once again related only to the cold spots 
and to be contained within 30 degrees of the galactic plane. 

We investigate our detections further by firstly conduct- 
ing an analysis in single frequency maps. We find some ev- 
idence for a dependence of the signal with frequency when 
we look at different hemispheres (peaking at 41GHz, corre- 
spond ing to the Q band and in agreement with lLiu fc Zhaii3 
(|200a)), but this detection does not appear in a full-sky anal- 
ysis. Secondly we remove the cosmological signal from the 
analysis by subtracting different frequency maps and testing 
the resulting foreground/noise combination maps for non- 
Gaussian signals. We find no signs of non-Gaussianity in 
these subtracted maps. 

How do we make sense of these results? A simple ex- 
planation seems untenable. The fact that the signal becomes 
insignificant when the galactic plane is removed suggests un- 
subtracted galactic foregrounds are responsible; the large- 
scale nature of the signal is certainly consistent with this 
picture. One would then expect the individual frequency 
maps to show a significant signal, and this we do find, most 
strikingly in the Q band. Ifowever, the difference maps do 
not show a significant detection; these maps should directly 
test the residual foregrounds and noise, so the absence of 
detected non-Gaussianity does not obviously support this 
picture. We can reconcile these observations if the residual 
foregrounds affect more than one frequency band, and the 
subtraction removes the contamination to some extent. The 
fact that we find non-Gaussianity in all the single-frequency 
bands adds some support to this complex picture. In our 
view this is the most likely explanation for the results we 
find, but we cannot exclude a primordial origin for at least 
part of the non-Gaussian signal. 
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